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Abstract. Complementation of Biichi automata, required for checking automata con- 
tainment, is of major theoretical and practical interest in formal verification. We consider 
two recent approaches to complementation. The first is the rank-based approach of Kupfer- 
man and Vardi, which operates over a DAG that embodies all runs of the automaton. This 
approach is based on the observation that the vertices of this DAG can be ranked in a 
certain way, termed an odd ranking, iff all runs are rejecting. The second is the slice-based 
approach of Kahler and Wilke. This approach tracks levels of "split trees" - run trees in 
which only essential information about the history of each run is maintained. While the 
slice-based construction is conceptually simple, the complementing automata it generates 
are exponentially larger than those of the recent rank-based construction of Schewe, and 
it suffers from the difficulty of symbolically encoding levels of split trees. 

In this work we reformulate the slice-based approach in terms of run DAGs and pre- 
orders over states. In doing so, we begin to draw parallels between the rank-based and 
slice-based approaches. Through deeper analysis of the slice-based approach, we strongly 
restrict the nondeterminism it generates. We are then able to employ the slice-based ap- 
proach to provide a new odd ranking, called a retrospective ranking, that is different from 
the one provided by Kupferman and Vardi. This new ranking allows us to construct a 
deterministic-in-the-limit rank-based automaton with a highly restricted transition func- 
tion. Further, by phrasing the slice-based approach in terms of ranks, our approach affords 
a simple symbolic encoding and achieves the tight bound of Schewe's construction. 
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1. Introduction 

The complementation problem for nondeterministic automata is central to the automata- 
theoretic approach to formal verification |Var0 7b] . To test that the language of an automa- 
ton A is contained in the language of a second automaton 23, check that the intersection of 
A with an automaton that complements 23 is empty. In model checking, the automaton A 
corresponds to the system, and the automaton 13 corresponds to a property |VW86j . While 
it is easy to complement properties given as temporal logic formulas, complementation of 
properties given as automata is not simple. Indeed, a word w is rejected by a nondeterminis- 
tic automaton A if all runs of A on w reject the word. Thus, the complementary automaton 
has to consider all possible runs, and complementation has the flavor of determinization. 
Representing liveness, fairness, or termination properties requires automata that recognize 
languages of infinite words. Most commonly considered are nondeterministic Biichi au- 
tomata, in which some of the states are designated as accepting, and a run is accepting 
if it visits accepting states infinitely often [P3uc62] . For automata on finite words, deter- 
minization, and hence also complementation, is done via the subset construction [RS59J. 
For Biichi automata the subset construction is not sufficient, and optimal complementation 
constructions are more complicated |Var 07a]. 

Efforts to develop simple complementation constructions for Biichi automata started 
early in the 60s, motivated by decision problems of second-order logics. Biichi suggested a 
complementation construction for nondeterministic Biichi automata that involved a Ramsey- 
based combinatorial argument and a doubly-exponential blow-up in the state space |Buc62| . 
Thus, complementing an automaton with re states resulted in an automaton with 2 2 ° n) 
states. In [SVW87] . Sistla et al. suggested an improved implementation of Biichi's con- 
struction, with only 2°( n ) states, which is still not optimal. Only in [Saf88] Safra introduced 
a determinization construction, based on Safra trees, which also enabled a 2°( nl °s n ) com- 
plementation construction, matching a lower bound described by Michel [Mic88] . A careful 
analysis of the exact blow-up in Safra's and Michel's bounds, however, reveals an expo- 
nential gap in the constants hiding in the 0() notations: while the upper bound on the 
number of states in the complementary automaton constructed by Safra is n 2n , Michel's 
lower bound involves only an re! blow up, which is roughly (re/e) ra . In addition, Safra's 
construction has been resistant to optimal implementations AT\Y0(i. THB95j . which has 
to do with the complicated combinatorial structure of its states and transitions, which can 
not be encoded symbolically. 

The use of complementation in practice has led to a resurgent interest in the exact 
blow-up that complementation involves and the feasibility of a symbolic complementation 
construction. In 2001, Kupferman and Vardi suggested a new analysis of runs of Biichi 
automata that led to a simpler complementation construction |KV01| . In this analysis, 
one considers a DAG that embodies all the runs of an automaton A on a given word w. 
It is shown in |KV01] that the nodes of this DAG can be mapped to ranks, where the 
rank of a node essentially indicates the progress made towards a suffix of the run with 
no accepting states. Further, all the runs of A on w are rejecting iff there is a bounded 
odd ranking of the DAG: one in which the maximal rank is bounded, ranks along paths 
do not increase, paths become trapped in odd ranks, and nodes associated with accepting 
states are not assigned an odd rank. Consequently, complementation can circumvent Safra's 
determinization construction along with the complicated data structure of Safra trees, and 
can instead be based on an automaton that guesses an odd ranking. The state space of such 
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an automaton is based on annotating states in subsets with the guessed ranks. Beyond the 
fact that the rank-based construction can be implemented symbolically [TV07], it gave rise 
to a sequence of works improving both the blow-up it involves and its implementation in 
practice. The most notable improvements are the introduction of tight rankings [FKV06] 
and Schewe's improved cut-point construction [Sch09]. These improvements tightened the 
(6n) n upper bound of |KV01| to (0.76n) n . Together with recent work on a tighter lower 
bound [Yan06| . the gap between the upper and lower bound is now a quadratic term. 
Addressing practical concerns, Doyen and Raskin have introduced a useful subsumption 
technique for the rank-based approach [DR09J. 

In an effort to unify Biichi complementation with other operations on automata, Kahler 
and Wilke introduced yet another analysis of runs of nondeterministic Biichi automata 
[KW08| . The analysis is based on reduced split trees, which are related to the Miiller-Schupp 
trees used for determinization [MS95]. A reduced split tree is a binary tree whose nodes are 
sets of states as follows: the root is the set of initial states; and given a node associated with 
a set of states, its left child is the set of successors that are accepting, while the right child 
is the set of successors that are not accepting. In addition, each state of the automaton 
appears at most once in each level of the binary tree: if it would appear in more than one set, 
it occurs only in the leftmost one. The construction that follows from the analysis, termed 
the slice-based construction, is simpler than Safra's determinization, but its implementation 
suffers from similar difficulties: the need to refer to leftmost children requires encoding of a 
preorder, and working with reduced split trees makes the transition relation between states 
awkward. Thus, as has been the case with Safra's construction, it is not clear how the slice- 
based approach can be implemented symbolically. This is unfortunate, as the slice-based 
approach does offer a very clean and intuitive analysis, suggesting that a better construction 
is hidden in it. 

In this paper we reveal such a hidden, elegant, construction, and we do so by unifying 
the rank-based and the slice-based approaches. Before we turn to describe our construction, 
let us point to a key conceptual difference between the two approaches. This difference has 
made their relation of special interest and challenge. In the rank-based approach, the ranks 
assigned to a node bound the visits to accepting states yet to come. Thus, the ranks refer 
to the future of the run, making the rank-based approach inherently nondeterministic. In 
contrast, in the slice-based approach, the partition of the states of the automaton to the 
different sets in the tree is based on previous visits to accepting states. Thus, the partition 
refers to the past of the run, and does not depend on its future. 

In order to draw parallels between the two approaches, we present a formulation of the 
slice-based approach in terms of run DAGs. A careful analysis of the slice-based approach 
then enables us to reduce the nondeterminism in the construction. We can then employ this 
improved slice-based approach in order to define a particular odd ranking of rejecting run 
DAGs, called a retrospective ranking. In addition to revealing the theoretical connections 
between the two seemingly different approaches, the new ranks lead to a complementa- 
tion construction with a transition function that is smaller and deterministic in the limit: 
every accepting run of the automaton is eventually deterministic. This presents the first 
deterministic-in-the-limit complementation construction that does not use determinization. 
Determinism in the limit is central to verification in probabilistic settings [CY95J and has 
proven useful in experimental results [S T03] , Phrasing slice-based complementation as an 
odd ranking also immediately affords us the improved cut-point of Schewe, the subsumption 
operation of Doyen and Raskin, and provides an easy symbolic encoding. 
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2. Preliminaries 

A nondeterministic Biichi automaton on infinite words (NBW for short) is a tuple A = 
(X, Q, Q m , p, F), where S is a finite alphabet, Q a finite set of states, Q m CQa set of initial 
states, FCQa set of accepting states, and p:QxE->2^a nondeterministic transition 
relation. A state q G Q is deterministic if for every a G E it holds that \p(q,a)\ < 1. 
We lift the function p to sets R of states in the usual fashion: p(R,a) = \J qeR p{q,o-). 
Further, we lift p to words word a$ ■ ■ ■ Oi by defining p(R, • • • <7j) = p{p{R, oo), 0"i ■ ■ ■ cij). 
For completeness, let p(R, e) = R. 

An infinite run of an NBW A on a word w = oq0i • • • € S w is an infinite sequence of 
states po,Pi, ■ ■ • G Q w such that €= Q m and, for every i > 0, we have pi + \ G p(pi,ai). 
Correspondingly, a /irate ran is a finite sequence of states po, . . . ,p n such that pq G Q m 
and, for every < i < n, we have pi + \ G p(pi,ai). When unspecified, a run refers to an 
infinite run. A run is accepting iff G F for infinitely many i G IN. A word u> G S w is 
accepted by A if there is an accepting run of A on w. The words accepted by A form the 
language oiA, denoted by L(A). The complement of L(A), denoted L(A), is T, U '\L(A). We 
say an automaton is deterministic in the limit if every state reachable from an accepting 
state is deterministic. Converting A to an equivalent deterministic in the limit automaton 
involves an exponential blowup [CY951 ISaf88| . One can simultaneously complement and 
determinize in the limit, via co-determinization into a parity automaton [Pit06], and then 
converting that parity automaton to a deterministic-in-the-limit Biichi automaton, with a 
cost of (n 2 /e) n . 

Run dags: Consider an NBW A and an infinite word w = a^a\ ■ ■ ■ . The runs of A on w 
can be arranged in an infinite DAG (directed acyclic graph) G = (V, E) , where 

• V Q Q x IN is such that (q, i) G V iff some finite or infinite run p of A on w has pi = q. 

• E C I J {Qx{i}) x (Qx{i+1}) is s.t. E({q,i),{q J ,i+l)) iff (q,i) G V and q' G p(q,<Ji). 

The DAG G, called the run DAG of A on w, embodies all possible runs of A on w. We are 
primarily concerned with initial paths in G: paths that start in Q m x {0}. Define a node 
(q, i) to be an i^-node when q G F, and a path in G to be accepting when it is both initial 
and contains infinitely many i^-nodes. An accepting path in G corresponds to an accepting 
run of A on w. When G contains an accepting path, call G an accepting run DAG, otherwise 
call it a rejecting run DAG. We often consider DAGs H that are subgraphs of G. A node u 
is a descendant of v in H when u is reachable from v in H. A node v is finite in H if it has 
only finitely many descendants in H. A node v is F-free in H if it is not an F-node, and 
has no descendants in H that are F- nodes. We say a node splits when it has at least two 
children, and conversely that two nodes join when they share a common child. 

Example 2.1. In Figured] we describe an NBW A that accepts words with finitely many 
letters b. On the right is a prefix of the rejecting run DAG of A on w = babaabaaabaaaa 

Rank-Based Complementation: If an NBW A does not accept a word w, then every 
run of A on w must eventually cease visiting accepting states. The notion of rankings, fore- 
shadowed in [Kla90] and introduced in [KVOlj . uses natural numbers to track the progress 
of each run in the DAG towards this point. A ranking for a DAG G = (V, E) is a mapping 
from V to IN, in which no F-node is given an odd rank, and in which the ranks along all 
paths do not increase. Formally, a ranking is a function r: V — > IN such that if u G V is 
an i^-node then r(u) is even; and for every u,v G V, if (u, v) G E then r(u) > r(v). Since 
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Figure 1. Left, the NBW A, in which all states are initial. Right, the re- 
jecting run DAG G of A on w = babaabaaabaaaa ■ ■ ■ . Nodes are superscripted 
with the prospective ranks of Section [2j 



each path starts at a finite rank and ranks cannot increase, every path eventually becomes 
trapped in a rank. A ranking is called an odd ranking if every path becomes trapped in 
an odd rank. Since -F-nodes cannot have odd ranks, if there exists an odd ranking r, then 
every path in G must stop visiting accepting nodes when it becomes trapped in its final, 
odd, rank, and G must be a rejecting DAG. 

Lemma 2.2. |KV01| If a run DAG G has an odd ranking, then G is rejecting. 

A ranking is bounded by I when its range is {0, /}, and an NBW A is of rank I when 
for every w L(A), the rejecting DAG G has an odd ranking bounded by I. If we can 
prove that an NBW A is of rank I, we can use the notion of odd rankings to construct a 
complementary automaton. This complementary NBW, denoted A l R , tracks the levels of 
the run DAG and attempts to guess an odd ranking bounded by I. An l-bounded level ranking 
for an NBW A is a function /: Q — > {0, . . . , I, _L}, such that if q € F then f(q) is even or 
_L. Let R l be the set of all l-bounded level rankings. The state space of A l R is based on the 
set of /-bounded level rankings for A. To define transitions of A l R , we need the following 
notion: for a £ £ and /, /' £ say that /' follows f under a when for every q € Q and 
q' S p(q,a), if f(q) ^ _!_ then f'(q') ^ _L and f'(q') < f{q)'- i.e. no transition between / and 
/' on a increases in rank. Finally, to ensure that the guessed ranking is an odd ranking, we 
employ the cut-point construction of Miyano and Hayashi, which maintains an obligation 
set of nodes along paths obliged to visit an odd rank |MH84| . For a level ranking /, let 
even(f) = {q \ f(q) is even} and odd(f) = {q j /(g) is odd}. 

Definition 2.3. For an NBW A = (S, Q, Q in , p, F) and I G IN, define A l R to be the NBW 

x 2®,(f in ,(l)),p R ,Jl l x {0}), where 

• f m {o.) = I f° r each q E Q m , J- otherwise. 

' {(/', p{0, a) \ odd(f')) | /' follows / under a} if O ^ 0, 
{(/', even(f')) \ f follows / under a} if O = 0. 



PR((f,0),a) 



By |KV01| . for every / G IN, the NBW A l R accepts only words rejected by A — exactly 
all words for which there exists an odd ranking with maximal rank I. In addition, [KV01| 
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proves that for every rejecting run DAG there exists a bounded odd ranking. Below we 
sketch the derivation of this ranking. Given a rejecting run DAG G, we inductively define 
a sequence of subgraphs by eliminating nodes that cannot be part of accepting runs. At 
odd steps we remove finite nodes, while in even steps we remove nodes that are F-fcee. 
Formally, define a sequence of subgraphs as follows: 

• G = G. 

• G 2 i+i = G 2i \ {v | v is finite in G 2 ,}. 

• G 2 j+2 = G 2 i+i \ {v | v is F-free in G 2 i+i\- 

It is shown in [GKSV03, IKVOlj that only m = 2\Q \ F\ steps are necessary to remove 
all nodes from a rejecting run DAG: G m is empty. Nodes can be ranked by the last graph 
in which they appear: for every node u G G, the prospective rank of u is the index i such 
that u G Gi but u G" Gi + \. The prospective ranking of G assigns every node its prospective 
rank. Paths through G cannot increase in prospective rank, and no F-node can be given an 
odd rank: thus the prospective ranking abides by the requirements for rankings. We call 
these rankings prospective because the rank of a node depends solely on its descendants. 
By [KV01], if G is a rejecting run DAG, then the prospective ranking of G is an odd ranking 
bounded by m. By the above, we thus have the following. 

Theorem 2.1. |KV01| For every NBW A, it holds that L(A%) = L(A). 

Example 2.4. In Figure [TJ nodes for states s and t are finite in Go- With these nodes 
removed, r-nodes are F-iree in G\. Without r-nodes, g-nodes are finite in G 2 . Finally, 
p- nodes are F-free in G3. 

Karmarkar and Chakraborty have derived both theoretical and practical benefits from 
exploiting properties of this prospective ranking: they demonstrated an unambiguous com- 
plementary automaton that, for certain classes of problems, is exponentially smaller than 
A% [KC09] . 

Tight Rankings: For an odd ranking r and / G IN, let max -rank (r, I) be the maximum 
rank that r assigns a vertex on level I of the run DAG. We say that r is iig/iQ if there exists 
an i G IN such that, for every level I > i, all odd ranks below max-rank(r, I) appear on level 
/. It is shown in [FKV06] that the retrospective ranking is tight. This observation suggests 
two improvements to A 7 ^. First, we can postpone, in an unbounded manner, the level in 
which it starts to guess the level ranking. Until this point, A 1 ^ may use sets of states to 
deterministically track only the levels of the run DAG, with no attempt to guess the ranks. 
Second, after this point, A 1 ^ can restrict attention to tight level rankings - ones in which 
all the odd ranks below the maximal rank appear. Formally, say a level ranking / with 
a maximum rank maxjrank = max{/(g) | q G Q, f(q) 7^ J_} is tight when, for every odd 
i < maxjrank, there exists a q G Q such that f(q) = i. Let 3?™ be the subset of 3? m that 
contains only tight level rankings. The size of is at most (0.76n) n [FKV06] . Including 
the cost of the cut-point construction, this reduces the state space of A^ to (0.96n) n . 



^This definition of tightness for an odd ranking is weaker that of [FKV06] . but does not affect the resulting 
bounds. 
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3. Analyzing dags With Profiles 

In this section we present an alternate formulation of the slice-based complementation 
construction of Kahler and Wilke jKW08] . Whereas Kahler and Wilke approached the 
problem through reduced split trees, we derive the slice-based construction directly from 
an analysis of the run DAG. This analysis proceeds by pruning G in two steps: the first 
removes edges, and the second removes vertices. 

3.1. Profiles. Consider a run DAG G = {V,E). Let the labeling function A: V — > {0, 1} 
be such that A((q,i)) = 1 if q £ F and A((q,i)) = otherwise. Thus, A labels F-nodes by 1 
and all other nodes by 0. The profile of a path in G is the sequence of labels of nodes in the 
path. The profile of a node is then the lexicographically maximal profile of all initial paths 
to that node. Formally, let < be the lexicographic ordering on {0, 1}* U {0, 1} W . The profile 
of a finite path b = vo,Vi, . . . , v n in G, written hb, is A(vq)A(vi) ■ ■ ■ A(v n ), and the profile 
of an infinite path b = Vq,vi, ... is hb = A(vq)A(vi) ■■■ . Finally, the profile of a node v, 
written h v , is the lexicographically maximal element of {hb \ b is an initial path to v}. The 
lexicographic order of profiles induces a preorder over nodes. 

We define the sequence of preorders X$ over the nodes on each level of the run DAG as 
follows. For every two nodes u and v on a level i, we have that u v if h u < h v , and u ~j v 
if h u = h v . For convenience, we conflate nodes on the ith level of the run DAG with their 
states when employing this preorder, and say q^r when (q, i) <i (r, %). Note that is an 
equivalence relation. Since the final element of a node's profile is 1 iff the node is an F-node, 
all nodes in an equivalence class must agree on membership in F. We call an equivalence 
class an F-class when all its members are F-nodes, and a non- F-class when none of its 
members is an F-node. We now use profiles in order to remove from G edges that are not 
on lexicographically maximal paths. Let G' be the subgraph of G obtained by removing all 
edges (u, v) for which there is another edge (u', v) such that u -<\ u \ u'. Formally, G' = (V, E') 
where E' = E\ {(u, v) \ there exists u' £ V such that (u', v) € E and u -<\ u \ u'}. 

Lemma 3.1. For every two nodes u and v, if (u,v) £ E' , then h v € {h u 0,h u l}. 

Proof. Assume by way of contradiction that h v £ {h u 0,h u l}. Recall that h v is the lexico- 
graphically maximal element of {hb \ b is an initial path to v}. Thus our assumption entails 
an initial path b to v so that hb > h u l. Let u' be b\ u i: the node on the same level of G as u. 
Since b is a path to v, it holds that (u', v) G E. Further, hb > h u l, it must be that h u i > h u . 
By definition of E', the presence of (u',v) where h u i > h u precludes the edge (u,v) from 
being in E' — a contradiction. □ 

Note that while it is possible for two nodes with different profiles to share a child in G, 
Lemma [3.11 precludes this possibility in Q . If two nodes join in G' , they must have the same 
profile and be in the same equivalence class. We can thus conflate nodes and equivalence 
classes, and for every edge (it, v) G E', consider [v] to be the child of [u]. Lemma I3TT1 then 
entails that the class [u] can have at most two children: the class of F-nodes with profile 
h u l, and the class of non-F-nodes with profile h u 0. We call the first class the F-child of 
[u], and the second class the non-F-child of [u]. 

By using lexicographic ordering we can derive the preorder for each level i +1 of the run 
DAG solely from the preorder for the previous level i. To determine the relation between 
two nodes, we need only know the relation between the parents of those nodes, and whether 
the nodes are F-nodes. Formally, we have the following. 
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Lemma 3.2. For all nodes u,v on level i, and nodes u',v' where E'(u,u') and E'(v,v'): 

• If u -<i v, then v! <i-\.\ v' . 

• If u ~j v and either both u' and v' are F -nodes, or neither are F -nodes, then v! v' . 

• // u ~j v and v' is an F-node while v! is not, then u' -<i+i v' . 

Proof. If u -<i v, then h u < h v and, by Lemma 13,11 we know that h u < G {h u 0, h u l} must be 
smaller than h v i G {h v 0, h v l}, implying that u' ^j+i v' . If u ~j v, we have three sub-cases. 
If v 1 is an F-node and u' is not, then h u i = h u = h v < h v l = h v > and u' ^i+i v' . If both 
u' and v' are F- nodes, then h u i = h u l = h v l = h v > and u' ~j v' . Finally, if neither u' nor 
v' are F-nodes, then h u / = h u = h v = h v > and u' ~j v' . □ 

We now demonstrate that by keeping only edges associated with lexicographically max- 
imal profiles, G' captures an accepting path from G. 

Lemma 3.3. G' has an accepting path iff G has an accepting path. 

Proof. In one direction, if G' has an accepting path, then its superset G has the same path. 

In the other direction, assume G has an accepting path. Consider the set P of accepting 
paths in G. We prove that there is a lexicographically maximal element tt G P. To begin, 
we construct an infinite sequence, P$, Pi, . . ., of subsets of P such that the elements of Pi 
are lexicographically maximal in the first i + 1 positions. If P contains paths starting in 
an .F-node, then Pq = {b \ b G P, bo is an F-node} is all elements beginning in F-nodes . 
Otherwise Pq = P. Inductively, if Pi contains an element b such that is an F-node, then 
Pi + i = {b | b £ Pi, bi + i is an F-node}. Otherwise Pi+\ = Pi. For convenience, define the 
predecessor of Pi to be P if i = 0, and Pi-\ otherwise. Note that since G has an accepting 
path, P is non-empty. Further, every set Pi is not equal to its predecessor P' only when 
there is a path in P' with an F-node in the ith position. In this case, that path is in F,. 
Thus every Pi is non-empty. 

First, we prove that there is a path tt G Hix) Consider the sequence Uq, U\, TJi, . . . 
where Ui is the set of nodes that occur at position i in runs in Fj. Formally, Ui = 
{u | u G G, b G Pi, u = bi}. Each node in Ui+i has a parent in Ui, although it may not 
have a child in Ui + 2- We can thus connect the nodes in Ui>o ^* *° their parents, forming a 
sub-DAG of G. As every Pi is non-empty, every Ui is non-empty, and this DAG has infinitely 
many nodes. Since each node has at most n children, by Koriig's Lemma there is an initial 
path 7r through this DAG, and thus through G. We now show by induction that tt G Pi for 
every i. As a base case, tt G P. Inductively, assume tt is in the predecessor P' of Pi. The 
set Pi is either P' , in which case tt G Pi, or the set {b \ b G P', bi is an F-node}. In this 
latter case, as Uj consists only of F-nodes, the node TXi must be an F-node. and tt G Pi. 

Second, having established that there must be an element tt G C\i >0 Pi, we prove tt 
is lexicographically maximal in P. Assume by way of contradiction that there exists an 
accepting path tt' so that h n i > h n . Let k be the first point where h n i differs from h n . 
At this point, it must be that tt^ is not an F node, while Tr' k is an F node. However, 
tt' is an accepting path that shares a profile with tt up until this point. As tt is in the 
predecessor P' of P&, it must also be that tt' is in P' . By definition, P^ then would be 
{b | b G P' , bk is an F-node}. This would imply tt ^ Pfe, a contradiction. 

Finally, we demonstrate that every edge in tt occurs in G' . Assume by way of contradic- 
tion that some edge (-7Tj, 7Tj+i) is in E but not in E' . This implies there is a node u on level 
i such that (u, TTi+\) is in E and 7Tj -<i u. Since u G G, there is an initial path b to u. Thus, 
the path b, u, TTi+\, vrj+2 ... is an accepting path in G. This path would be lexicographically 
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larger than tt, contradicting the second claim above. Hence, we conclude tt is an accepting 
path in G' . □ 

In the next stage, we remove from G' finite nodes. Let G" = G' \ {v \ v is finite in G'}. 
Note there may be nodes that are not finite in G, but are finite in G' . It is not hard to see 
that G may have infinitely many F-nodes and still not contain a path with infinitely many 
F-nodes. Indeed, G may have infinitely many paths each with finitely many F-nodes. We 
now show that the transition from G via G' to G" removes this possibility, and the presence 
of infinitely many F-nodes in G" does imply the existence of a path with infinitely many 
F-nodes. 

Lemma 3.4. G has an accepting path iff G" has infinitely many F-nodes. 

Proof. If G has an accepting path, then by Lemma 13.31 the DAG G' contains an accepting 
path. Every node in this path is infinite in G' , and thus this path is preserved in G" . This 
path contains infinitely many F-nodes, and thus G" contains infinitely many F-nodes. 

In the other direction, we consider the DAG over equivalence classes induced by G" . 
Given a node u in G" , recall that its equivalence class in G" contains all states v such that 
v € G" and h u = h v . Given two equivalence classes U and V, recall that V is a child of 
U when there are u £ U , v £ V , and E"(u,v). As mentioned above, once we have pruned 
edges not in G', two nodes of different classes cannot join. Thus this DAG is a tree. Further, 
as every node u in G" is infinite and has a child, its equivalence class must also have a child. 
Thus the DAG of classes in G" is a leafless tree. The width of this tree must monotonically 
increase and is bounded by n. It follows that at some level j the tree reaches a stable width. 
We call this level j the stabilization level of G. 

After the stabilization level, each class U has exactly one child: as noted above, U 
cannot have zero children, and if U had two children the width of the tree would increase. 
Therefore, we identify each equivalence class on level j of G" with its unique branch of 
children in G", which we term its pipe. These pipes form a partition of nodes in G" after j. 
Every node in these pipes has an ancestor, or it would not be in the DAG, and has a child, 
or it would not be infinite and in G" . Therefore each node is part of an infinite path in this 
pipe. Thus, the pipe with infinitely many F-classes contains only accepting paths. These 
paths are accepting in G, which subsumes G". □ 

In the proof above we demonstrated there is a stabilization level j at which the number 
of equivalence classes in G" stabilized, and discussed the pipes of G": the single chain of 
descendants from each equivalence class on the stabilization level j of G". 

Example 3.5. Figure [2] displays G" for the example of Figure [TJ Edges removed from 
G' are dotted: at levels 1 and 3 where both q and r transition to r. When both r and 
s transition to t, they have the same profile and both edges remain. The removed edges 
render all but the first (/-node finite in G' . The stabilization level is 0. 

3.2. Complementing With Profiles. We now complement A by constructing an NBW, 
As, that employs Lemma 13.41 to determine if a word is in L(A). This construction is a 
reformulation of the slice-based approach of [KW08] in the framework of run DAGs: see 
Appendix [A] The NBW As tracks the levels of G' and guesses which nodes are finite in 
G' and therefore do not occur in G" . To track G' , the automaton .As stores at each point 
in time a set S of states that occurs on each level. The sets S are labeled with a guess of 
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{?,*} {p,r,s} 
W >-! {q,t} >t {p} 
{r,s} y 2 {q,t} y 2 {p} 

{*} ^3 M y 3 {q} y 3 M 

{t} y t {r,s} ^ 4 {5} ;- 4 {p} 



Figure 2. The run dag G" , where dotted edges were removed from G and 
dotted states were removed from G' . Nodes are superscripted with their 
A-labels. Bold lines denote the pipes of G" . The lexicographic order of 
equivalence classes for each level of G' is to the right. 

which nodes are finite and which are infinite. States that are guessed to be infinite, and thus 
correspond to nodes in G" , are labeled T, and states that are guessed to be finite, and thus 
omitted from G" , are labeled _L. In order to track the edges of G', and thus maintain this 
labeling, As needs to know the lexicographic order of nodes. Thus As also maintains the 
preorder -<i over states on the corresponding level of the run DAG. To enforce that states 
labeled _L are indeed finite, As employs the cut-point construction of Miyano and Hayashi 
[MH84], keeping an "obligation set" of states currently being verified as finite. Finally, to 
ensure the word is rejected, As must enforce that there are finitely many F-nodes in G" . 
To do so, Sa uses a bit b to guess the level from which no more F- nodes appear in G" . 
After this point, F- nodes must be labeled J_. 

Before we define As, we formalize preordered subsets and operations over them. For a 
set Q of states, define Q = {(S, ^) [ S C Q and ^ is a preorder over S} to be the set of 
preordered subsets of Q. Let {S, ^} be an element in Q. When considering the successors 
of a state, we want to consider edges that remain in G' . For every state q G S and a G E, 
define p^ s ^)(q,cr) = {r G p(q,&) | for every q' G S, if r G p(q',a) then q' •< q}. Now define 
the a -successor of (S, ■<) as the tuple (z^S", <r), ^'), where for every q,r G 5, q' G P(s,~<) (l? 
and r' G p {s ^)(r,a): 

• If q -< r, then q' -<' r' 

• If q ~ r and either both r' G F and q' G F, or both r' F and q' F, then q' ~' r' . 

• If q ~ r and one of q' and r', say r' , is in F while the other, q' , is not, then q' ~<f r' . 

We now define As- The states of As are tuples (S,^,X,0,b) where: (S,^) G Q is 
preordered subset of Q; A: 5 — > {T, _L} is a labeling indicating which states are guessed to 
be finite (_L) or infinite (T); O C S is the obligation set; and b G {0, 1} is a bit indicating 
whether we have seen the last F-node in G" . To transition between states of A s , say that 
H = (S', A', O', b') follows t = {S, ^, A, O, b) under a when: 

(1) (S', is the u-successor of {S, X). 

(2) A' is such that for every q G S: 

• If X(q) = T, then there exists r G p^s : -<)(q,o-) such that A'(r) = T, 
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• If X(q) = X, then for every r G p(g^(q,a), it holds that A'(r) = _L. 

(3) a = /^ e ° p ( s >$( q ' <r ') ° ^ ' 

\{g I 9 G 5' and \'{q) = 1} = 0. 

(4) 6' > 6. 

We want to ensure that runs of reach a suffix where all F-nodes are finite. To this end, 
given a state of As {S, <,X,0,b), we say that A is F-free if for every q G S n F we have 
A(<?) = ±. 

Definition 3.6. For an NBW A = {T,,Q,Q in , p,F), let A s be the NBW (E,Qs, Qf, ps, F S ), 
where: 

• Qs = {{S,±,\,0,b) | if b = 1 then A is F-free}, 

• = {(Q™, A, 0, 0} | for all ?,re Q m , g ^ r iff g F or r G F}, 

• ps(t, a) = {t' | t' follows t under a}, and 

• F S = {(S,^,A,0,1}}. 

We divide runs of As into two parts. The prefix of a run is the initial sequence of states 
in which bi is 0, and the suffix is the remaining sequence states, in which hi is 1. A run 
without a suffix, where b stays for the entire run, has no accepting states. 

Theorem 3.1. For every NBW A, it holds that L{A S ) = L(A). 

Proof. Consider a word w G S w and the run DAG G. We first make the following claims 
about every infinite run to,ti,..., where tj = {Si, ^i, Aj, Oj, bi). For convenience, define 
§i = {Si, r^i)- 

(1) T/ie states in Si are precisely {q \ (q,i) G G}. 

We exploit this claim to conflate a state q in the ith state with the node (q, i), and speak 
of states in Si being in, being finite in, and being infinite in a graph G. 

(2) The preorder ^ is the projection of ■< onto states occurring at level i. 

This follows from Lemma 13.21 and the definition of one state in As following another. 

(3) For every p G Si, q G Si + i, it holds that q G ps^p,^) iff E'((p,i), (q,i +1>). 
This follows from the definitions of E' and p§- 

(4) Oj is empty for infinitely many indices i iff every state labeled _L is not in G". 
This follows from the cut-point construction of Miyano and Hayashi. [MH84J. 

(5) Every state labeled T is in G" . 

This follows from the definition of transitions between states: every T-labeled state 
must have a T-labeled child, and thus is infinite in G' and in G" . 

We can now prove the theorem. In one direction, assume there is an accepting run to, ti, 

As this run is accepting, infinitely often Oj = 0. Byd]and[5j this implies the states in Si 
are correctly labeled T when and only when they occur in G" . Further, for this run to be 
accepting we must be able to divide the run into a prefix, and suffix as specified above. In 
the suffix no state in F can be labeled T, and thus no F-nodes occur in G" past this point. 
As only finitely many F-nodes can occur before this point, by Lemma 13.31 G does not have 
an accepting path and w L(A). 

In the other direction, assume w £ L(A). This implies there are finitely many F-nodes 
in G" , and thus a level j where the last F-node occurs. We construct an accepting run 
to,ti, . . ., demonstrating along the way that we satisfy the requirements for tj+i to be in 
ps(tj,o"j). Given w, the sequence {So, r^o)> {Si, ^i), . . . of preordered subsets is uniquely 
defined by ps- There are many possible labelings A. For every i, select Aj so that a state 
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q G Si is labeled with T when (q,i) G G", and _L when it is not. Since every node in G" 
has a child, by O for every p G Si where Xi(p) = T, there exist a g G PSiiPi&i) so that 
= T. Further, every node labeled _L has only finitely many descendants, and so for 
every p G Sj where \i(p) = _L and q G p% t {p, it holds that Aj+i(g) = _L. Therefore the 
transition from Aj to Aj+i satisfies the requirements of ps- The set Oq = 0, and given the 
sets Si and labelings Aj, the sets Oj+i, i > are again uniquely defined by p^. Finally, we 
choose bi = when i < j, and bi = 1 for i > j. Since there are no i* 1 - nodes in G" past j, 
no -F-node will be labeled T and all states past j will be F-free. We have satisfied the last 
requirement for the transitions from every tj to tj+i to be valid, rendering this sequence a 
run. ByUl infinitely often Oj = 0, including infinitely often after j, thus there are infinitely 
many states tj where bi = 1 and Oi = 0, and this run is accepting. □ 

If n = \Q\, the number of preordered subsets is roughly (0.53n) n [Var80| . As there are 
2™ labelings, and a further 2™ obligation sets, the state space of A s is at most (2n) n . The 
slice-based automaton obtained in [KW08J coincides with As, modulo the details of labeling 
states and the cut-point construction (see Appendix [A]) . Whereas the correctness proof in 
[KW08| is given by means of reduced split trees, here we proceed directly on the run DAG. 

4. Retrospection 

Consider an NBW A. So far, we presented two complementation constructions for A, 
generating the NBWs A 1 ^ and As- In this section we present a third construction, generating 
an NBW that combines the benefits of the two constructions above. Both constructions 
refer to the run DAG of A. In the rank-based approach applied in A 1 ^, the ranks assigned 
to a node bound the visits in accepting states yet to come. Thus, the ranks refer to the 
future, making A 1 ^ inherently nondeterministic. On the other hand, the NBW As refers to 
both the past, using profiles to prune edges from G, as well as to the future, by keeping in 
G" only nodes that are infinite in G'. Guessing which nodes are infinite and labeling them 
T inherently introduces nondeterminism into the automaton. 

Our first goal in the combined construction is to reduce this latter nondeterminism. 
Recall that a labeling is .F-free if all the states in F are labeled _L. Observe that the fewer 
labels of _L (finite nodes) we have, the more difficult it is for a labeling to be F-iree and, 
consequently, the more difficult it is for a run of As to proceed to the F-hee suffix in which 
6=1. It is therefore safe for As to underestimate which nodes to label _L, as long as the 
requirement to reach an F-free suffix is maintained. We use this observation in order to 
introduce a purely retrospective construction. 

For a run DAG G, say that a level k is an F '-finite level of G when all F-nodes after 
level k (i.e. on a level k! where k' > k) are finite in G'. By Lemma |3.4| G is rejecting iff 
there is a level after which G" has no F- nodes. As finite nodes in G' are removed from G" , 
we have: 

Corollary 4.1. A run DAG G is rejecting iff it has an F-finite level. 
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4.1. Retrospective Labeling. The labeling function A used in the construction of As 
labels nodes by {T, _L}, with _L standing for "finite" and T standing for "infinite". In this 
section we introduce a variant of A that again maps nodes to {T, _L} except that now T 
stands for "unrestricted", allowing us to underestimate which nodes to label _L. To capture 
the relaxed requirements on labelings, say that a labeling A is legal when every _L-labeled 
node is finite in G' . This enables the automaton to track the labeling and its effect on F- 
nodes only after it guesses that an F-finite level k has been reached: all nodes at or before 
level k (i.e. on a level k' where k' < k) are unrestricted, whereas F-nodes after level k and 
their descendants are required to be finite. The only nondeterminism in the automaton lies 
in guessing when the F- finite level has been reached. This reduces the branching degree of 
the automaton to 2, and renders it deterministic in the limit. 

The suggested new labeling is parametrized by the F-finite level k. The labeling \ k is 
defined inductively over the levels of G. Let Si be the set of nodes on level i of G. For 
i > 0, the function A fc : Si — >■ {T, _L} is defined as follows: 

• If i < k, then for every it 6 Si we define X k (u) = T. 

• If i > k, then for every u E Sf. 

— If u is an F-node, then X k (u) = _L. 

— Otherwise, X k (u) = X k (v), for a node v where E'(v,u). 

For X k to be well defined when i > k and u is not an F-node, we need to show that X k (u) 
does not depend on the choice of the node v where E'(v,u) holds. By Lemma 13.11 all 
parents of a node in G' belong to the same equivalence class. Therefore, it suffices to prove 
that all nodes in the same class share a label: for all nodes u and it', if v! ~m u then 
X k (u) = X k (u'). The proof proceeds by an induction on i = \u\. Consider two nodes u and 
v! on level % where v! ~j u. As a base case, if i < A;, then u and v! are labeled T. For 
% > k, if u is an F-node, then u' is also an F-node and X k (u) = X k (u') = _L. Finally, if u 
and v! are both non-F-nodes, recall that all parents of u are in the same equivalence class 
V. As u ~j u' , Lemma 13.11 implies that all parents of v! are also in V. By the induction 
hypothesis, all nodes in V share a label, and thus X k (u) = X k {u'). 

Lemma 4.1. For a run DAG G and k G IN, the labeling X k is legal iff k is an F -finite level 
forG. 

Proof. If X k is legal, then every _L-labeled node is finite in G' . Every F-node after level k 
(i.e. on a level i where i > k) is labeled _L, and thus k is an F-finite level for G. If X k is 
not legal, then there is a _L-labeled node u that is infinite in G' . Every ancestor of u is also 
infinite. Let u' be the earliest ancestor of u (possibly u itself) so that X k (u') = _L. Observe 
that only nodes after level k can be _L-labeled, and so v! is on a level i > k. It must be 
that v! is an F-node: otherwise it would inherit its parents' label, and by assumption the 
parents of u' are T-labeled. Thus, u' is an F-node after level k that is infinite in G' , and k 
is not an F-finite level for G. □ 

Corollary 4.2. A run DAG G is rejecting iff, for some k, the labeling X k is legal. 

4.2. From Labelings to Rankings. In this section we derive an odd ranking for G from 
the function X k , thus unifying the retrospective analysis behind X k with the rank-based 
analysis of |KV01j . Consider again the DAG G' and the function X k . Recall that every 
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equivalence class U has at most two child equivalence classes, one -F-class and one non-F- 
class. After the .F-finite level k, only non-F-classes can be labeled T. Hence, after level 
k, every T-labeled equivalence class U can only have a one child that is T-labeled. For 
every class U on level k, we consider this possibly infinite sequence of T-labeled non-F- 
children. The odd ranking we are going to define, termed the retrospective ranking, gives 
these sequences of T-labeled children odd ranks. The T-labeled classes, which lie between 
these sequences of T-labeled classes, are assigned even ranks. The ranks increase in inverse 
lexicographic order, i.e. the maximal T-labeled class in a level is given rank 1. As with X k , 
the retrospective ranking is parametrized by k. The primary insight that allows this ranking 
is that there is no need to distinguish between two adjacent T-labeled classes. Formally, we 
have the following. 

Definition 4.2 (^-retrospective ranking). Consider a run DAG G, k G IN, and the labeling 
X k : G — > {T, T}. Let m = 2\Q \ F\. For a node u on level i of G, let a(u) be the number 
of T-labeled classes lexicographically larger than it; a(u) = \{[v] \ X k (v) = T and u -<i v}\. 
The k- retrospective ranking of G' is the function r k : V — > {0..m} defined for every node u 
on level i as follows. 

m if i < k, 

2a(u) if i > k and X k (u) = T, 
2a(u) + 1 if i > k and X k (u) = T. 



r k (u) 



Note that r k is tight. As defined in Section [21 a ranking is tight if there exists an i G IN 
such that, for every level I > i, all odd ranks below max -ranker, I) appear on level I. For 
r k this level is k + 1, after which each T-labeled class is given the odd rank greater by two 
than the rank of the next lexicographically larger T-labeled class. 

Lemma 4.3. For every k G IN, the following hold: 

(1) If u -<\ u \ u' then r k (u) > r k (u'). 

(2) If(u,v) G E' , then r k (u) > r k (v). 

Proof. As both claims are trivial when u is at or before level k, assume u is on level i > k. 
To prove the first claim, note that a(u) > a(u'): every class, T-labeled or not, that is 
larger than v! must also be larger than u. If a(u) > a(u'), then (1) follows immediately. 
Otherwise a(u) = a(u'), which implies that X k (u r ) = T: otherwise [u'] would be a T- 
labeled equivalence class larger than u, but not larger than itself. Thus r k (u') = 2a(u), and 
r k (u) G {2a(u), 2a(n) + l} is at least r k (u'). 

As a step towards proving the second claim, we show that a(u) > a(v). Consider every 
T-labeled class [v'] where v -<i+\ v' . The class [v'] must have a T-labeled parent [u']. Since 
v -<i+i v 1 , the contrapositive of Lemma l3.2| part 1, entails that u ^ u' . By the definition 
of X k , the class [u ; ] can only have one T-labeled child class: [v'}. We have thus established 
that for every T-labeled class larger than v, there is a unique T-labeled class larger than u, 
and can conclude that a(u) > a(v). We now show by contradiction that r k (u) > r k (v). For 
r k (u) < r k (v), it must be that a(u) = a(v), that r k (u) = 2a(u), and that r k (v) = 2a(u)+l. 
In this case, X k (u) = T and X k (v) = T. Since a T-labeled node cannot have a T-labeled 
child in G' , this is impossible. □ 



When k is an F-finite level of G, the fc-retrospective ranking is an m-bounded odd 
ranking. 
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Lemma 4.4. For a run DAG G and k G IN", the function r k is a ranking bounded by m. 
Further, if the labeling X k is legal then r k is an odd ranking. 

Proof. There are three requirements for r k to be a ranking bounded by m: 

(1) Every F-node must have an even rank. At or before level k, every node has rank m, 
which is even. After k only T-labeled nodes are given odd ranks, and every F-node is 
labeled _L. 

(2) For every (u,v) G E, it must hold that r k (u) > r k (v). If u is at or before level k, then 
it has the maximal rank of m. If u is after level k, we consider two cases: edges in 
E', and edges in E \ E' . For edges in E' , this follows from Lemma 14.31 (2). For edges 
(u, v) G E \ E' , we know there exists a u' where u -<i u i v! and (u',v) G E' . By Lemma 
Pj r k (u) > r k (u') > r k (v). 

(3) The rank is bounded by m. No -F-node can be T-labeled. Thus the maximum number 
of T-labeled classes on every level is The largest possible rank is given to a node 
smaller than all T-labeled classes, which must be be a F-node and T-labeled. Thus, 
this node is given a rank of at most m = 2\Q \ F\. 

It remains to show that if X k is legal, then r k is an odd ranking. Consider an infinite path 
uq,ui, ... in G. We demonstrate that for every i > k such that r k (u{) is an even rank e, 
there exists i' > i such that r k (ui>) ^ e. Since a path cannot increase in rank, this implies 
r k (v,i>) < e. To do so, define the sequence Ui, E/i+i, . . ., of sets of nodes inductively as 
follows. Let Ui = {v | r k (v) = e}. For every j > i, let Uj + \ = {v \ v' G Uj, (v',v) G E'}. As 
r k (v) is even only when X k (v) = T, if X k is legal then every node given an even rank (such 
as e) must be finite in G' . Therefore every element of Ui is finite in G' , and thus at some 
i' > i, the set Uy is empty. Since Uy is empty, to establish that r k (ui>) ^ e, it is sufficient 
to prove that for every j, if r k (uj) = e, then Uj G Uj. 

To show that r k (uj) = e entails Uj G Uj, we prove a stronger claim: for every j > i and 
v on level j, if Uj <j v and r fc (t>) = e, then v £ Uj. We proceed by induction over j. For the 
base case of j = i, this follows from the definition of U. For the inductive step, take a node 
v on level j +1 where r k (v) = e and Uj + \ r^+i v. We consider two cases. If r k (uj + \) ^ e 
then the path from U{ to Uj+i entails that Y k (uj+i) < e, and this case of the subclaim 
follows from Lemma 14.31 (1). Otherwise, it holds that r fc (n J+ i) = e, and thus r k (uj) = e. 
Let u' and v' be nodes on level j so that {u' ,Uj + \) G E' and G E' . As u^+i ^j+i 

the contrapositive of Lemma l3.2( part 1, entails that u' <j v' . Further, since (v! ' ,Uj + \) G E' 
and (uj,Uj + \) G E, we know Uj -<j v! . By transitivity we can thus conclude that Uj -<j v' , 
which along with Lemma 14.31 (1) entails r k (u') = e > r (v'). As (v',v) G E, Lemma 14.31 (2) 
entails that r k (v') > r k (v) = e. Thus r k (v') = e, and by the inductive hypothesis v' G Uj. 
As E'(v',v) holds, by definition v G C/j+i, and our subclaim is proven. □ 

The ranking of Definition 14.21 is termed retrospective as it relies on the relative lexico- 
graphic order of equivalence classes; this order is determined purely by the history of nodes 
in the run DAG, not by looking forward to see which descendants are infinite or F-free in 
some subgraph of G. 

Example 4.5. Figure [3] displays A and the O-retrospective ranking of our running example. 
In the prospective ranking (Figure [5]) , the nodes for state t on levels 1 and 2 are given rank 
0, like other t-nodes. In the absence of a path forcing this rank, their retrospective rank is 
2. 
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{<?,*} {p,r,s} 
M >-i {q,t} f-i {p} 
{r,s} y 2 {q,t} y 2 M 
{*} ^3 W y 3 {q} y s M 

{t} y 4 {r, s} ^ 4 {?} >-4 {p} 



FIGURE 3. The run dag G', where is an F-finite level. The labels of A and 
ranks in r° are displayed as superscripts and subscripts, respectively. The 
bold lines display the sequences of T-labeled classes in G' . The lexicographic 
order of states is repeated on the right. 



We are now ready to define a new construction, generating an NBW Al, which combines 
the benefits of the previous two constructions. The automaton Al guesses the F-finite level 
k, and uses level rankings to check if the /c-retrospective ranking is an odd ranking. We 
partition the operation of Al into two stages. Until the level k, the NBW Al is in the 
first stage, where it deterministically tracks preordered subsets. After level k, the NBW 
Al moves to the second stage, where it tracks ranks. This stage is also deterministic. 
Consequently, the only nondeterminism in Al is indeed the guess of k. Before defining Al, 
we need some definitions and notations. 

Recall that Q denotes the set of preordered subsets of Q, and 01™ the set of tight level 
rankings bounded by m. We distinguish between three types of transitions of Al- transitions 
within the first stage, transitions from the first stage to the second, and transitions within 
the second stage. The first type of transition is similar to the one taken in As, by means of 
the cr-successor relation between preordered subsets. Below we explain in detail the other 
two types of transitions. Recall that in the retrospective ranking r fc , each class in G' labeled 
T by \ k is given a unique odd rank. Thus the rank of a node u depends on the number of 
T-labeled classes larger than it, denoted a(u). 

We begin with transitions where Al moves between the stages: from a preordered 
subset (S, ^) to a level ranking. On level k + 1, a node is labeled T iff it is a non-F-node. 
Thus for every q € S, let f3(q) = \{[v] \ v € S \ F,u -< v}\ be the number of non-F-classes 
larger than q. We now define torank: Q — > 3?™. Let torank((5, <)) be the tight level 
ranking / where for every q: 

'± if qg S, 

f(q) = I 2P{q) if qeSCiF, 

2P(q)+l if q£S\F. 

We now turn to transitions within the second stage, between level rankings. The rank 
of a node v is inherited from its predecessor u in G' . However, A fc may label a finite 
class T. If a T-labeled class larger than u has no children, then a(u) > a(v). In this 
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case the rank of v decreases. Given a level ranking /, for every q £ Q where /(g) 7^ _L, 
let 7(g) = \{f{q') I q' G Q, f(q') is odd, f(q') < f{q)}\ be the number of odd ranks in the 
range of / lower than f(q). We define the function tighten: 3i m — > 3?™. Let tighten(/) 
be the tight level ranking /' where for every q: 

'-L if /(g) = J_, 

/V) = I 27(9) if f{q) + J- and q G F, 
2 7 (q)+l if /(g) + J_ and g F. 

Note that if / is tight, then /' = /, and that while tighten may merge two even ranks, it 
cannot merge two odd ranks. 

For a level ranking /, a letter a £ E, and g' £ Q, let pred(g', cr, /) = {q | /(g) 7^ _L, q' G 
/?(g,c)} be the predecessors of g' given a non-_L rank by /. The lowest ranked element in 
this set corresponds to the predecessor in G with the maximal profile. With two exceptions, 
q' will inherit this lowest rank. First, tighten might shift the rank down. Second, if q' is 
in F, it cannot be given an odd rank. For n G IN, let [n\ even be: n when n is even; and 
n— 1 when n is odd. Define the a -successor of f to be tighten(/') where for every q' G Q: 

'l ifpred(g>,/) = 0, 

f'(q') = I Lmin({/(g) | g G pred(g', a, f)})\ eV en if pred(g', a, /) J- and g' € F, 
min({/(g) | g G pred(g', a, /)}) if pred(g', a, /) ^ and g' F. 

Definition 4.6. For an NBW 71 = (T,,Q,Q in ,p, F), let .A L be the NBW 
(E,QU (3Bp x 2«),Q 4 L n ,/) L ,^ x {0}}, where 

• Qf = {(Q in , < in )} where < in is such that for all g, r £ Q in , g ^ r iff g F or r £ F. 

• pl(S,ct) = {§'} U {(torank(S'), 0)}, where §' is the u-successor of S. 

• PL{{f,0),a) = {(/', O'}} where /' is the u-successor of / 

a ndO'=i p{0 ^ )X ° dd{f) if ° /0 ' 
\even(f) if O = 0. 

The proof of Theorem 14.31 is based on Lemmas 12.21 and 14.41 and Corollary 14.21 



Theorem 4.3. For every NBW A, it holds that L(A L ) = L(A). 

Proof. Consider a word w G S w and the run DAG G. We first make the following claims 
about every infinite run (So, ^0), • • • , (Sk, {fk+i, Ofc+i}, {fk+2, O fc+2 ), . . .. For i > k, 
define Si = {q \ fi(q) / _L}. 

(1) The states in Si are precisely {q \ (q,i) G G}. 

This follows by the definitions of cr-successors of preordered subsets and cr-successors of 
level rankings. 

(2) The preorder ^ is the projection of ^ onto states occurring at level i. 
This follows from Lemma 13.21 and the definition of cr-successors. 

(3) For every i < k, state q G Si, and s G Si+i, it holds that s G p^.^.^g, <7j) iff 
E'((q,i),(s,i+l}). 

This follows from the definitions of E' and p^.^.y 

(4) For every i > k and g, s G Si, if fi(q) > fi(s), then (q,i) -<i (s,i). 

(5) For every i> k and g, s G Si, if fi(s) is odd and (q,i) -<i (s,i), then fi(q) > fi(s). 
This and U] are proven below. 



18 



S. FOGARTY, O. KUPFERMAN, T. WILKE, AND M. Y. VARDI 



(6) For every i> k and q € Si, it holds that fi(q) is even iff X k ((q,i)) = _L. 

This follows from the definition of X k , which assigns _L to F- nodes and their descendants 
in G', and fi, which assigns even ranks to states in F. BylH the parent of a node in 
G' will be the parent with the lowest rank. Thus the descendants of F-nodes in G' will 
inherit the even rank of their parent. 

We simultaneously prove H] and [5] by induction. As a base case, both hold from the definition 
of torank. As the inductive step, assume both hold for level i. To prove step HI take 
two states q,s € Si+l where fi+\{q) > /j+i(s). Each state has a parent in G' , i.e. a q' 
and s' so that E'(q',q) and E'(s',s). By the inductive hypothesis, this implies fi(q') = 
m±n({fi(q') \ q G p(q',0i)}) and fi(s') = min({/j(s') | s € p(s', We analyze two cases. 

When fi(q') > fi(s'), by the inductive hypothesis we have (q',i) <i (s',i). Since E'(q',q) 
and E'(s',s), by Lemma \3A\ this implies (q, i +1) -<i+i {s,i +1). Alternately, when fi(q') = 
fi(s'), then for fi+\(q) > fi+i(s) to hold, it must be that fi(q') is odd, s £ F, and q F. 
Since fi(q') = fi(s') is odd, by the inductive hypothesis we have that (q',i) = (s',i). By 
Lemma [3711 we then have h/ qii+ i\ = fy ? ',i)Q < fya.i+i) = h/gi^l. 

To prove step O consider when is odd and (q,i +1) -< (s,i +1). This implies 

that fys,i+i) = ^( s ',i)0. Thus in order for (q,i+l) -<i+i (s,i+l) to hold, (q',i) -<i (s',i) 
must hold. By the inductive hypothesis, this implies f%(q') > fi(s'). Before the tighten 
function reduces ranks, since fi + \{q) = [fi(q')\ e ven, and fi + i(s) is odd, it must be that 
fi+i(o) > fi+i( s )- The tighten function can shift fi + \{q) down more than fi + \{s) only 
when an odd rank between and becomes empty. Since this odd rank must 

be two greater than /j + i(s), reducing fi+i(q) by 2 cannot change that fi+\{q) > /j+i(s). 
We now proceed with the proof of Theorem 14.31 

In one direction, assume the run (S , :^o), ■ ■ ■ , (<Sfc, ^k), (fk+i, O k +i), {fk+2, Ofc+2>, • • • 
is accepting. We construct a ranking r of G as follows. For all nodes u on level i < k, 
r(u) = m. For all nodes (q,i) where i > k, r((q,i)) = fi(q). We note that each state 
is given at most the minimum rank of all its parents, and that no state in F is given an 
odd rank, thus r is in fact a ranking. That r is an odd ranking follows from the cut-point 
construction. 

In the other direction, assume G is a rejecting run DAG. By Lemma [4.41 there exists a k 
so that r fc is an odd ranking. We construct a run So, ... , (fk+i, Ofc+i), {fk+2: Ok+2): ■ ■ 
which is uniquely defined by the transition relation of Definition 14. 61 Further, the transition 
relation of Definition 14.61 is total, so this run is infinite. To demonstrate that this run is 
accepting, we will prove below that for every % > k and q € Si, it holds that fi(q) = 
r k ((q,i)). Since r k is an odd ranking and the cut-point construction is identical to that of 
Definition 12.31 this is sufficient to show the run is accepting. 

Recall that if X((q,i)) = _L, then r k ((q,i)) = 2a((q,i}), and otherwise r k ((q,i)) = 
2a((s,i)) +1. We can thus use [6] to simplify our claim. It suffices to show that for every 
i > k and q € Si, we have a((q,i)) = [fi(q)/2\. We proceed by induction over i > k. As 
the base case, consider a node (q, k). Recall that a((q, k)) = \{[v] | X k (v) = T, (q, k) -<k 
By the definition of X k , a node on level k is labeled _L only when it is an F-node. All 
other nodes inherit the label of their parents, and every node on level k is T-labeled. From 
[21 we then have that a((q, k + 1}) = \{[v] \ v G S \ F, u -< v}\, which is the definition of 
P(q)=[fi(q)/2\. 

Inductively, assume the claim holds for every q € Si. We show for every s € S*j+i, it 
holds that a((s, i +1)) = |_/i+i(s)/2j . Let q be the parent of s in G', i.e. E'(q,s). Take 
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the set P = {[v] \ X k (v) = T, (q,i) <i v}. of T-labeled equivalence classes greater than Q, 
By the inductive hypothesis, [fi(q)/2\ = a((q,i)) = \P\. By the definition of r fc , each 
[v] £ P has a unique odd rank assigned to each of its elements. By [5j for each [v] this 
odd rank is smaller than fi(q). Consider the subset of P given by P s = {[v] \ [v] G P, 
[v] has T-labeled child class on level i +1}. Define P e = P \ P s to be the complementary 
set: pipes that die on level i. By[5j before the tighten operation is applied, every element 
of P e has a corresponding odd rank that is unoccupied on level i +1 . Since q is clearly not in 
an element of P e , this odd rank must be less than \_fi(q) [even- Thus the final rank assigned 
to s, after tighten, is either fi(q) — 2\P e \ or |_/i(<z) — 2|P e |J euen . In both cases |_/i+i(s)/2j = 
[fi(q)/2\ — \P e \. By the inductive hypothesis this is equivalent to a({q, i)) — \P e \ = \P\ — \P e \. 
By the definition of P s and P e , \P\ — \P e \ = \P S \- By Lemma 13-H every T-labeled child of a 
class in P s is lexicographically larger than (s,i +1). As every T-labcled child must have a 
unique parent in P s , we conclude that \P S \ = a((s,i +1)). □ 

Analysis: Like the tight-ranking construction in Section [21 the automaton Al operates 
in two stages. In both, the second stage is the set of tight level rankings and obligation 
sets. The tight-ranking construction uses sets of states in the first stage, and is bounded by 
the size of the second stage: (0.96n) n [FKV06| . The automaton Al replaces the first stage 
with preordered subsets. As the number of preordered subsets is ^((rjjn?)™) ~ (0.53n)™ 
|Var80| . the size of Al remains bounded by (0.96n) n . This can be improved to (0.76n) n : 
see Section [5] and [Sch09] . Further, Al has a very restricted transition relation: states in 
the first stage only guess whether to remain in the first stage or move to the second, and 
have nondeterminism of degree 2. States in the second stage are deterministic. Thus the 
transition relation is linear in the number of states and size of the alphabet, and Al is 
deterministic in the limit. 

5. Variations on Al 

In this section we present two variations of Al'- one based on Schewe's variant of the rank- 
based construction that achieves a tighter bound; and one that is amenable to Tabakov and 
Vardi's symbolic implementation of the rank-based construction. Schewe's construction 
alters the cut-point of the rank-based construction to check only one even rank at a time. 
Doing so drastically reduces the size of the cut-point: intuitively, we can avoid carrying the 
obligation set explicitly. Instead we could carry the current rank i we are checking, and add 
to the domain of our ranking function a single extra symbol c that indicates the state is 
currently being checked, and thus is of rank i. For an analysis of the resulting state space, 
please see [Sch09] . For clarity , we do not remove the obligation set from the construction. 
Instead, states in this variant of the automaton carry with them the index i, and in a state 
(f,0,i), it holds that O C {q \ f(q) = i}. For a level ranking /, let mr(/) be the largest 
rank in /. Note that mr(/), for a tight ranking, is always odd. 

Definition 5.1. For an NBW A = (£, Q, Q in , p, F), let A Schewe be the NBW 
(E, QU(rx2«x N),Qf, PSch , F Sch ), where 

• PSch(S,°') = {(torank(S'), 0, 0)} U {§'}, where §>' is the a-successor of S. 

• p S ch((f,0,i),a) = {(f',0',i')} where 

/' is the cr-successor of / 
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i> = 



and a 



F Sch = X m x {0} x {0} 



i if O + { 

(i+2) mod (mr(f) +1) if O = [ 

jp(0,a)\odd(f) ifO/0, 

I {q I /'(?) = *'} if O = 0- 



Theorem 5.1. For every NBW .A, it holds that L(As c hewe) = L(A). 

Proof. Given a word w, we relate the runs of Aschewe and Al- As both automata are 
comprised of two internally deterministic stages, with a nondeterministic transition, each 
index k defines a unique run for each automaton. As the first stage of both automata are 
identical, and the second stage is deterministic, given a fixed k let 

Pl = (So, r<o), • • • , (Sk, r<fc), (fk+i, Ok+i), (fk+2, Ok+2), ■ ■ ■ 
be the run of Al on w that moves to the second stage on the kth transition, and let the 
corresponding run of Aschewe be 

PSch{So, ^0)7 • • • 1 {Sk, ^k), (fk+i, 0' k+1 ,n k+ i), (f' k+2 , 0' k+2 , n k+ 2), ■ ■ ■ 
We show that pl is accepting iff ps c h is accepting, or more precisely that pl is rejecting iff 
psch is rejecting. First, we note that the level rankings fk+i, fk+2, ■ ■ ■ and f' k+1 , f' k+ 2, • • • in 
both automata are defined by torank((5fc, and the u-successor relation, and thus for 
every j > k, it holds fj = f'-. 

In one direction, assume that psch is rejecting. This implies there is some j > k so that 
for every j' > j, O'-, is non-empty. In turn, this implies that there is a sequence qj, qj+i, ■ ■ ■ 
of states so that, for every j' > j, we have that qy £ O'-,, that fj>{qj') = rij, and that 
qj'+i G p(qj',Wji). If there is no / > j where Oi = 0, then we have that pl is rejecting. 
Alternately, if there is such a I > j, then qi + \ £ O/+1, and for every I' > I we have qy G Oy. 
Again, this implies pl is rejecting. 

In the other direction, assume that pl is rejecting. This implies there is some j > k so 
that for every j' > j the set Oy is non-empty. In turn, this implies that there is an even 
rank i and sequence qj,qj+\, ... of states so that, for every j' > j, we have that qy € Oy, 
that fy{qy) = i, and that qy+i G p(qy,wy). We now consider the indexes ny in psch- If 
there is some j' > j where ny = i, then for every I > j', it will hold that q\ G 0[, and 
PSch w iU be rejecting. Alternately, if there is no j' > j where ny = i, then it must be that 
the indexes ny stops cycling through the even ranks. This implies the obligation set stops 
emptying, and therefore that psch must be rejecting. □ 

To symbolically encode a deterministic-in-the-limit automaton, we avoid storing the 
preorders. To encode the preorder in a BDD as a relation would require a quadratic number 
of variables, increasing the size unacceptably. Alternately, we could associate each state with 
its index in the preorder. Unfortunately, calculating the index of each state in the succeeding 
preorder would require a global compacting step, to remove indices that had become empty. 
To handle this difficulty, we simply store only the subset in the first stage, and transition to 
an arbitrary level ranking when we move to the second stage. This maintains determinism in 
the limit, and cannot result in false accepting run: we can always construct an odd ranking 
from the sequence of level rankings. The construction and a small example encoding are 
provided below. 
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Definition 5.2. For an NBW A = (Y,,Q,Q in ,p, F), let A Symb be the NBW 
(£,2« U x 2Q),Q in ,ps ymb ,K m x {0}), where 

• PSymb(S,a) = {p(S,a)} U {</,0> | / € R m and for all q € Q, /(g) /liffge p(S,a)}. 

• PSymb((f,0),a) = p L ({f,0),a) 

Theorem 5.2. For every NBW A, it holds that L(A Sym b) = L(A). 

Proof. In one direction, assume w S L(A). This implies w 6 L(yix), and thus there exists 
an accepting run (S , ^ ), • • • , (Sfc, ^fc), </fc+i,Ofc+i>, {/fc+2, Ofc+2), • • • of .Al on We show 
that So,..., S k , (f k+ i, Ojt+i), (/fc+2, Ofc+2>, ••• is an accepting run of A.s,, m6 on w. We note 
that in the second stage transitions and accepting states in As ym b are identical to Al ■ Thus 
to show that this is an accepting run As ym b, we only need show that the run is valid from 
to k + 1, 

By definition, S*o = Q m is the initial state of As ym b- F° r every i, < i < k, it holds that 
Si+i = p{Si,Wi) £ Ps V mb{Si,Wi). Finally, consider the transition from S k to (fk+i,O k+1 ). 
Let (Sfc+i, ^fc+i) be the a-successor of (Sk,^k)- By definition, S^+i = p(Sk,Wk). By 
the transition relation of Al, we have f^+i = torank((5' / ! c+ i, ^k+i)) an d O^+i = 0- By 
the definition of torank, for every q G Q it holds that fk+i(l) = -L i& q £ "Sfc+i- Thus 
(fk+l, Ofc+i) e PSymb(Sk), and 5 , . . . , 5 fc , (/ fc+ i, O fc+ i>, </ fe+2 , O fc+2 >, • • • is an accepting run 
of As ym b on to. 

In the other direction, if w E L(As ym b), there is an accepting run So, . . . , Sk, (/fe+i, Ok+i), 
(/fc+2, Ofc+2>, • • • of As y mb on to. From this run we construct an odd ranking of G, which 
implies w G L(A). Define the ranking function r so that for every (q,i) £ G: if i < k then 
r ((?j*>) = m = 2\Q \ F\; and if i > k then r((q,i)) = fi(q). As demonstrated in the proof 
of Theorem 14.31 the definition of <r-successors and G implies that when i > k, it holds that 
fi(q) 7^ -L. Similarly, by the definition of cr-successors no path in G can increase in rank 
under r. We conclude that r is a valid ranking function. 

To demonstrate that r is an odd ranking, assume by way of contradiction that there 
is a path {qo,0}, (qi, 1), . . . in G that gets trapped in an even rank. Let j be the point at 
which this path gets trapped, or k + 1, whichever is later. If there is no f > j such that 
Oji = 0, then there is no accepting state after j, and the run would not be accepting. If 
there is such a j ' , then Oj'+i would contain qf+i, as /,-/+! is even. At every point 
j" > j' + 1, the obligation set will contain qyi , and thus there will be no accepting state 
after j', and the run would not be accepting. However, we have that the run is accepting 
as a premise. Therefore no path in G gets trapped in an even rank, r is an odd ranking, 
and by Lemma \Z2\w <F L(A). □ 

As an example, Figure H] is the SMV encoding of the complement of a two-state au- 
tomaton. 

6. Discussion 

We have unified the slice-based and rank-based approaches by phrasing the former in the 
language of run DAGs. This enables us to define and exploit a retrospective ranking, pro- 
viding a deterministic-in-the-limit complementation construction that does not employ de- 
terminization. Experiments show that the more deterministic automata are, the better 
they perform in practice [ST03J. By avoiding determinization, we reduce the cost of such a 
construction from (n 2 /e) n to (0.76n) n [Pit06j. 
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1; /* Size for complemented automaton: 2, maximum allowed rank = 2*/ 



typedef STATE 
module mainO { 

letter: {a,b}; 

rank : array STATE of . . 3 ; 

phase : . . 1 ; 

subset: array STATE of boolean; 
init(rank) := [2,2,2,2]; 
init(subset) := [1,1,1,1]; 
init (phase) := 0; 

next(phase) := {i : i=0..1, i >= phase}; 



/* The transition letter */ 
/* The value 3 represents bottom */ 

/* The phase of the automaton, ranks 2 or 3 in phase 0*/ 
/* The obligation set vector */ 
/* 2 to initial states, 3 to others */ 
/* initially rejecting */ 



/* Define the rank of states in the next time step. Cases fall through. */ 
/* state has transition from on a and b */ 
next (rank [0] ) := case { 

rank [0] =3 : 3; 

next (phase) =0 : 2; 

phase=0 k next(phase)=l : {i : i=0..2, i <= rank[0]}; 
phase=l : rank[0]; 

}; 



/* 1 has transition from 1 on a and from on b. 1 is accepting */ 
next (rank [1] ) := case { 

letter=a k rank [1] =3 : 3; 

letter=a k next (phase) =0 : 2; 

letter=a k phase=0 k next (phase) =1 

letter=a k phase=l : {i : i=0..2, : 

letter=b k rank [0] =3 : 3; 

letter=b k next (phase) =0 : 2; 

letter=b k phase=0 k next (phase) =1 

letter=b k phase=l : {i : i=0..2, : 

}; 



: {i : i=0..2, 
in {rank [1] , 



: {i : i=0..2, 
in {rank [0] , 



i <= rank[l] 
rank[l]-l} k : 



i <= rank[0] 
rank[0]-l} k : 



i in {0,2}}; 
in {0,2}}; 



i in {0,2}}; 
in {0,2}}; 



/* Defining the transitions of the P-set */ 
if (next (phase) =0) { 

forall (i in STATE) next (subset [i] ) := 1; 
} else { 

if (subset=[0,0,0,0] ) { /* The P-set is empty */ 

forall (i in STATE) next (subset [i] ) := next (rank [i] ) in {0,2}; 
} else { /* The P-set is non-empty */ 
if (letter=a) { 

next (subset [0] ) := (subset [0]) k next (rank [0] ) in {0,2}; 
next (subset [1] ) := (subset [1]) k next (rank [1] ) in {0,2}; 
} else { /* letter=b */ 

next (subset [0] ) := (subset [0]) k next (rank [0] ) in {0,2}; 
next (subset [1] ) := (subset [0]) k next (rank [1] ) in {0,2}; 

}}} 
SPEC 0; 
FAIRNESS subset=[0,0,0,0] ; 

} 



Figure 4. The SMV encoding of the As y mb, f° r the two-state automaton 
consisting of states p and q of Figure [TJ 



In addition, our transition generates a transition relation that is linear in the number 
of states and size of the alphabet. Schewe demonstrated how to achieve a similar linear 
bound on the transition relation, but the resulting relation is larger and is not deterministic 
in the limit [Sch09] . 



UNIFYING BUCHI COMPLEMENTATION CONSTRUCTIONS 



23 



As shown in Section EJ the use of level rankings affords several improvements from 
existing research on the rank-based approach. First, the cut-point construction of Miyano 
and Hayashi [MH84J can be improved. Schewe's construction only checks one even rank at a 
time, reducing the size of the state space to (0.76n) n , only an n 2 factor from the lower bound 
[SchOQj. As Schewe's approach does not alter the progression of the level rankings, it can 
be applied directly to the second stage of Definition 14.61 The resulting construction inherits 
the asymptotic state-space complexity of [Sch09]. Second, symbolically encoding a preorder 
is complicated. In contrast, ranks are easily encoded, and the transition between ranks is 
nearly trivial to implement in SMV |TV07] . By changing the states in first stage of Al from 
preordered subsets to simple subsets, and guessing the appropriate transition to the second 
stage, we obtain a symbolic representation while maintaining determinism in the limit. 
This approach sacrifices the linear-sized transition relation, but this is less important in a 
symbolic encoding. Finally, although not addressed in Section [5l the subsumption relations 
of Doyen and Raskin [DR09] could be applied to the second stage of Definition 14.61 while 
it is unclear if it could be applied at all to the slice-based construction. 

From a broader perspective, we find it very interesting that the prospective and retro- 
spective approaches are so strongly related. Odd rankings seem to be inherently "prospec- 
tive," depending on the descendants of nodes in the run DAG. By investigating the slice- 
based approach, we are able to pinpoint the dependency on the future to a single component: 
the F-free level. This suggests it may be possible to use odd rankings for determinization, 
automata with other accepting conditions, and automata on infinite trees. 
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Appendix A. Slices 

The paper of Kahler et al. introduces the notion of the split tree, reduced split tree, and 
skeleton of an automaton A and word w Trees are represented as prefix-closed non-empty 
subsets of {0, 1}*. In a tree V, a node vO is called the left child of v, and vl the right 
child of v. The root is e. A node v is said to be on level i when \v\ = i. For a set L, an 
L-labeled tree is a pair (V, I) where V is a tree and I : V — > L is a label function. By abuse 
of notation, for an L-labeled tree T = (V, I) and vertex v, say v G T when v G V, and let 
T(v) = l(v). For two nodes v and v' , say that v' > v when \v\ = \v'\ and v' is to the right 
of, i.e. lexicographically larger than, v. 

The split tree, written T sp , is the 2^-labeled tree defined inductively as follow^. As a 
base case, e G T sp and T sp {e) = Q in . Inductively, given a node v on level i, let P = T sp (v). 
If p(P, Wi) \ F ^ then € T sp and T s p(v0) = p(P, Wj )\F. S imilarly, if p(P, w t ) n F + 0, 
then vl £ T sp and T sp (?;l) = /o(P, iu^HF. As argued in [KW08] , branches in T sp correspond 
to runs of A on w. We gloss over this discussion and simply state that w € L{A) iff T sp (A, w) 
has a branch that goes right infinitely often. The split tree is analogous to G w . Each path 
p to node (q, i) G G w corresponds to a node v on level i of T sp that contains g in its label. 
Edges in G w correspond to edges in T sp , and thus paths in G w correspond to paths in T sp . 

Lemma A.l. For every state q and level i, (q,i) G G w iff there is at least one node v G T sp 
where \v\ = i and q G T sp (v). 

Proof. We prove this by simple induction over i. As the base case we have that e G T sp (v) 
and T sp (e) = Q m , while by definition (q, 0) G G w iff q G Q m . Thus our lemma holds for 
i = 0. 

Inductively, assume that the lemma holds for i = 1, and let g' £ Q. In one direction, if 
(q', i+1) G G w , then there is a run p so that pi + \ = q' . By the inductive hypothesis, there is a 
node v G T s p where \v\ = i and pi G T sp (t>). If g' F, then p(T ap (v),wi) \ F + 0, uO G T s p, 
and g' G T s P(uO) = p(T sp (v), Wi ) \ F. Similarly, if q' G F, then p(T sp (v),Wi) n F ^ 0, 
ul G T sp , and g' G r s P(vl) = p(T* (v),Wi) n F. 

In the other direction, if there is a node G T sp so that \v'\ = i+1 and q' G T sp (v), 
then has a parent t> so that \v\ = i. As q' G p(T sp (v),Wi), there is a state g G T sp (v) 
so that g' G p(q,Wi). By the inductive hypothesis, (g, i) G G^, and by definition g G 
p(Q m , wo ■ ■ ■ Wi-i). By the definition of a run, this implies q' G p(Q m ,wo ■ ■ ■ Wi), and thus 
(q',i + l)£G w . □ 

Lemma A. 2. For every q, q' , and i, it holds that (<g, i), (q' , i + i}) G F i/f £/iere are nodes v 
and v' in T sp so that \v\ = i, v' is a child of v, g G T sp {v), and q' G T sp (v'). 

Proof. In one direction, let g, g', and i be such that ((g, i), (q' , i + i)) G F. By the definition 
of F, we have g' G p(q,Wi). By Lemma lA. 1|, there is a node w G T sp so that \v\ = i and 
g G T sp (v). If g' ^ F, then let v' = vO, otherwise g' G F and let v' = vl. In either case that 
q' G p{q,Wi) implies that i/ G T sp and g' G T sp (v'). 

In the other direction, let g, g', and j be such that there are nodes v and v 1 in T sp 
where \v\ = i, v' is a child of v, q G T sp (v), and g' G T sp {v'). By Lemma lA.H we have that 
(g, i) G and (g', i + 1> G G w . By the definition of T sp , we have g' G p(q,Wi), and thus 
((9, t), ((/,« + *>) GF. □ 



Compared to [K W08| . these definitions reverse the left and right children. This was done to match the 
paper. 
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The reduced split tree, written T rs , keeps only the rightmost instance of each state at 
each level of the tree. This bounds the width of T rs to n. Formally, we define T rs inductively 
as follows. As a base case, the root e G T rs , and T rs (e) = Q in . Inductively, given a node 
v G T rs on level i, let P = T rs (v) and let P' = \J{p(T rs (v') I v' G T rs and v' < v}. If 
(p(P,Wi) \F)\P' ^ then vO G T rs and T rs (vO) = (p(P,Wi) \ F) \ P' . Similarly, if 
(p(P, Wi) n F) \ 0, then ul G T rs and T rs (wl) = (p(P, Wi ) nf)\ P'. The reduced split 
tree is analogous to the profiles of nodes in G w and the edges in G' . Since paths in G w 
correspond to paths in T sp , the lexicographically maximal path through G w to a node (q, i) 
corresponds to the rightmost path through T sp to an instance of q on level i. This is the 
only instance that remains in T rs . 

Lemma A. 3. For every node (q,i) G G' , there is a node v G T rs where \v\ = i and 
q G T rs (v). Further, hi q ^\ = Of. 

Proof. By Lemma I A. 11 for every (q,i). there is at least one node v' G T rs where |t>'| = i 
and q G T rs (v'). Let v be the rightmost such node. We must show that hi q ^\ = Ov, and 
we do so by induction over i. As a base case, we have that q G Q m , i = 0, and v = e. 
Since, by assumption, Q m n F = 0, we have h^q, 0} = = Of. Inductively, assume that this 
lemma holds for a fixed i, and let q' be such that (q' , i + 1} G G". Let b be if q' P, and 
1 if g' G P. Since there are no orphan nodes in G' , we know that there is a q on level i 
such that ({q,i), {q',i + 1>) G G". By Lemma IBTTj we know that h^q',i + 1} = h^q,i)b, and 
that (g, i> has the lexicographically maximal profile of all predecessors of {q',i + 1). By the 
inductive hypothesis, there is a node v G T rs so that \v\ = i, q G T rs (v), and = f- 

Since lexicographic maximality in profiles corresponds to being rightmost in the tree, this 
means v is the rightmost node containing q in T sp . Thus v b is the rightmost node containing 
q' in T sp , and the only node containing q' in T rs . □ 

Lemma A. 4. Por every q,q' , and i, it holds that ((q,i), {q',i + i)) G E' iff there are nodes 
v and v' in T rs so that \v\ = i, v' is a child of v, g £ T rs (v), and q' G T rs (v'). 

Proof. This follows from Lemma IA.3I and Lemma 13.11 □ 

Finally, the skeleton T sp is obtained by removing from the reduced split tree all nodes 
that are finite. As a corollary of Lemma lA.41 the skeleton is a representation of G" . The 
slice automaton of Kahler and Wilke proceeds by tracking the levels of T rs and guessing 
which nodes occur in T sp . Each level i of T rs is encoded as a slice, a sequence (Po, . . . , P m ) 
of pairwise disjoint subsets of Q. This slice is precisely the sequence of equivalence classes in 
level i of G' , indexed by their relative lexicographic ordering (see Figure [2]). The automaton 
of Kahler and Wilke differs from Definition 13.61 only in the details of labeling states and the 
cut-point construction. 
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